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If a quantum system is subject to noise, it is possible to perform quantum error correction reversing the action 
of the noise if and only if no information about the system's quantum state leaks to the environment. In this 
article, we develop an analogous duality in the case that the environment approximately forgets the identity 
of the quantum state, a weaker condition satisfied by e-randomizing maps and approximate unitary designs. 
Specifically, we show that the environment approximately forgets quantum states if and only if the original 
channel approximately preserves pairwise fidelities of pure inputs, an observation we call the fidelity alternative. 
Using this tool, we then go on to study the task of using the output of a channel to simulate restricted classes of 
measurements on a space of input states. The case of simulating measurements that test whether the input state 
is an arbitrary pure state is known as equality testing or quantum identification. We establish that the optimal 
amortized rate at which quantum states can be identified through a noisy quantum channel is equal to the 
entanglement-assisted classical capacity of the channel, despite the fact that the task is quantum, not classical, 
and entanglement-assistance is not allowed. In particular, this rate is strictly positive for every quantum channel, 
including classical channels, even though the ability to identify cannot be cloned. 



I. INTRODUCTION 

Quantum channels in modern quantum information the- 
ory |Q]] are modeled as completely positive and trace- 
preserving maps Af ■ S(A) S(B) between the state spaces 
of quantum systems with Hilbert spaces A and B. The re- 
quirement of complete positivity means that Af is not just 
positive, mapping positive semidefinite operators to positive 
semidefinite operators, but that id <g) Af is positive for the 
identity map id on any S(R). This distinction plays a cen- 
tral role in the geometry of entanglement because positive but 
not completely positive maps can be used to identify entan- 
gled quantum states [2]. This paper will take as its starting 
point a similar observation about channel norms. 

The Stinespring dilation theorem establishes a fundamen- 
tal property of quantum channels: for every channel Af there 
exists an ancilla space E and an isometry V : A B <g) E 
such that Af{p) = tr E VpV* H. 

This means that quantum 
noise can always be interpreted as information loss in an oth- 
erwise deterministic evolution. Since E and V are essentially 
unique (up to unitary equivalence), each channel Af also has 
an associated complementary channel Af c : S(A) — > S(E), 
with Af c (p) = trs V pV\ which is uniquely defined up to 
coordinate changes of E. 

In quantum Shannon theoretic error correction we try to 
find two channels £ and T> (an encoder and decoder) such that 
T> o Af o £ « id. For now we shall consider the encoding £ 
fixed, so that Af o £ can be treated as a single channel. The 
central insight of quantum error correction [4-7] is that the 
existence of a decoding operation T> for a channel Af, i.e. 

VpeS(RA) \\(id®VoAf)p RA -p^ld < e, (1) 

is equivalent to the complementary channel being completely 



forgetful: for all Hilbert spaces R, 

Vp,<r e S(RA) || (id® Af c ) P RA - (id ®AS c )o- RA \\ < 5, 

(2) 

with a universal relation between e and 6. 

Here we determine a matching duality for the weaker 
property of the complementary channel being only (approx- 
imately) forgetful: 

Vp,<reS{A) WAf^p^-Af^a*)^ < 5. (3) 

That this is a much weaker property was noticed in the con- 
texts of approximate encryption and remote state prepara- 
tion |Hl @]. The difference between Eqs. (|2]i and (0 is pre- 
cisely the difference between two norms on superoperators, 
the naive one inherited from the trace norm, and the so- 
called completely bounded norm j7l [Tol[Tlll . Not surprisingly, 
Eq. ([3]) will hold provided the main channel approximately 
preserves the pairwise fidelities between input pure states, a 
property we call geometry preservation: 

V|^)»Gj4 ly-^-WAf^-Af^l <e. (4) 

In fact, the reverse is also true. Our investigations will re- 
volve around the fidelity alternative, which asserts that a chan- 
nel is geometry-preserving if and only if its complement 
Af c is approximately forgetful, with dimension-independent 
functions relating S and e. Thus, an isometry with two outputs 
can preserve geometry to at most one of them. Symmetrically, 
the isometry can be forgetful to at most one output. A sin- 
gle choice determines which output preserves geometry and 
which is forgetful; this choice gives the principle its name. 

The geometry preservation property, though much weaker 
than transmission of quantum information, must nonetheless 
be considered a way of preserving coherence: by virtue of the 
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fidelity alternative, geometry preservation cannot be cloned. 
Indeed, if a channel has multiple outputs, one of which is 
geometry-preserving, then the rest must be forgetful. 

Via the fidelity alternative, the many known examples of 
approximately forgetful channels that are not completely for- 
getful also provide examples of geo metry-preserving channels 
that are not correctable ll8ll9l [l2T - [l7ll . Most strikingly, it is pos- 
sible to preserve geometry while almost halving the number 
of qubits from input to output [18]. In that case, the geometry 
of the unit sphere in A is necessarily encoded into the eigen- 
vectors and eigenvalues of the much smaller output state on 
B. In contrast to quantum error correction, dimension count- 
ing reveals the mixedness of the output state to be crucial to 
preserving the geometry. Some of the geometry of the input 
state space of pure quantum states is thus faithfully encoded 
as noise in the output state. 

Moreover, the analogy with the quantum error correction 
duality can be made much stronger. There is a channel com- 
munication task very similar to quantum state transmission 
which is intimately related to geometry preservation: quan- 
tum identification lll8l.[l9Tl . 

Quantum identification is a cooperative communication game 
between two parties - conventionally called Alice and Bob 
- where Alice has a given quantum state that she encodes in 
some way into the channel, and Bob only wants to simulate 
measurements consisting of an arbitrary pure state projector 
and its complement, which can interpreted as performing the 
experiment asking "Is this the state?" lfl8ll . The idea is that 
Alice has an encoding channel £ and Bob has, for every pure 
state if, a POVM {D v , 1 - D v ) such that 



< e. 



(5) 



Such an object is called an e-quantum-ID code. (The name is 
adapted from the classical case I20I - I22I1 .) 

Note that Bob measures the output of the channel, but the 
quality of the code is measured by how well the statistics of 
this measurement approximate the statistics of the ideal mea- 
surement he wants to perform on the message state. While it 
may seem that this is an odd way of defining a quantum com- 
munication task, normal quantum error correction can also be 
described this way; namely, Bob wants to be able to simu- 
late all measurements on the message state. Clearly, if he can 
perform quantum error correction in the usual sense, then he 
can perform the simulation. But conversely, it follows from 
the methods of 123142511 that if he only has two measurements 
approximating generalized X and Z observables sufficiently 
well, he can build a quantum error correction procedure T>. 
Moreover, a quantum-ID code with e = is itself a quan- 
tum error correcting code; there is no difference between error 
correction and identification if both tasks are to be performed 
perfectly. Even the task of transmitting classical information 
is conveniently reflected in this framework. In that case, Bob 
only wants to simulate the measurement of the generalized Z 
observable. 

With this, one can define in the usual way a quantum-ID 
capacity Qid {M) of many uses of the channel as the highest 
rate at which qubits can be encoded and decoded as in Eq. (O 



with vanishing error - see Section Hill for details. Previously 
it was only known that for the noiseless qubit channel id2, 
QiD(id2) = 2, double the value of both the the quantum and 
classical transmission capacities 01811 . 

While reasoning directly about quantum identification 
codes has proved challenging, the duality between geome- 
try preservation and approximate forgetfulness provides a new 
approach to studying them. Up to some technical conditions, 
geometry preservation is equivalent to the existence of a quan- 
tum identification code. It is therefore possible to construct 
quantum identification codes by finding approximately for- 
getful maps. This approach is fruitful because destroying 
information is a comparatively indiscriminate task. Indeed, 
the analogous strategy has led to a number of straightforward 
proofs of the has hing bound on the quantum capacity of a 
quantum channel M24 126M28I1 . Classical data is not immune 
to analysis by purification either. The duality between privacy 
amplification and data compression with quantum side infor- 
mation has recently led to a proof in this spirit [29] of the 
Holevo-Schumacher- Westmoreland theorem on the classical 
capacity of a quantum channel |5j |30[] . 

With the fidelity alternative in hand, it is even possible to 
calculate a simple formula for an amortized version of the 
quantum identification capacity; it is exactly equal to the 
entanglement-assisted classical capacity of a quantum chan- 
nel. 



A. Structure of the paper 

Section|lI]contains the formal statement and proof of the fi- 
delity alternative. The duality is studied in more detail in Sec- 
tion |nl] where forgetfulness is shown to be nearly equivalent 
to quantum identification. In that section we provide a simple 
statement whose proof eliminates many technical difficulties, 
as well as a more flexible version that we prove from first 
principles. Section [IV] uses the flexible version of the equiv- 
alence to construct quantum identification codes for memory- 
less quantum channels. 



B. Notation 

We will restrict our attention throughout to finite dimen- 
sional Hilbert spaces. If A is a Hilbert space, we write S(A) 
for the set of density operators acting on A. Also, if A 
and B are two finite dimensional Hilbert spaces, we write 
AB = A (gi B for their tensor product. The Hilbert spaces 
on which linear operators act will be denoted by a superscript. 
For instance, we write ip AB for a density operator on AB. Par- 
tial traces will be abbreviated by omitting superscripts, such as 



ip A = trs tp A " ■ We use a similar notation for pure states, e.g. 



\iP) AB e AB, while abbreviating ^ AB = \ip)(^\ AB . We will 
write idyi for the identity map on S(A) and id2 for the identity 
qubit channel. The symbol l A will be reserved for the iden- 
tity matrix acting on the Hilbert space A and ir A — t A /\A\ 
for the maximally mixed state on A (where we denote by |^4| 
the dimension of the Hilbert space A). 
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The trace norm of an operator, ||X||i is defined to be 
tr |X| = tr v X^X. The similarity of two density operators 
ip and ip can be measured by the trace distance ^\\tp — 
which is equal to the maximum over all possible measure- 
ments of the variational distance between the outcome proba- 
bilities for the two states. The trace distance is zero for iden- 
tical states and one for perfectly distinguishable states. 

A complementary measure is the mixed state fidelity 



F((p,ip) = VWV' i = f tr sj vW>V^) , 



(6) 



defined such that when one of the states is pure, F(ip, ip) = 
tr ipip. More generally, the fidelity is equal to one for iden- 
tical states and zero for perfectly distinguishable states. We 
will make frequent use of the following fundamental inequal- 
ity between fidelity and trace distance of states Ir3ll Prop. 5]: 

1 



l-y/F&rf) < -l^-Vlli < y/l - F(ip,^). (7) 

Both measures can be extended to unnormalized states, but 
Eq. (O need not hold in that case. Further properties of the 
distance measures are collected in AppendixlAl 



H. THE FIDELITY ALTERNATIVE 

Our investigations will revolve around the duality between 
geometry preservation and approximate forgetfulness, which 
we call the fidelity alternative. The rigorous statement is as 
follows: 

Theorem 1 (Fidelity alternative) Let Af : S(A) -» S{B) 
be a quantum channel with complementary channel Af c : 
S(A) — > S(E). Approximate geometry preservation on B 
implies approximate forgetfulness for E. That is, 



implies , \(p) € A 



\\<P-il>\\i-\M<p)-tfy>)h<s 

\\Af c {ip)-Af c (ip)\\ 1 <4n/2^/ 4 . 



Conversely, approximate forgetfulness for E implies approxi- 
mate geometry preservation on B: 

V|V), \<p) G A \\N c {lp) - M c {i))\x < e implies 

V\ip), \(p) G A y-^-pSM-ffy,)^ <AV2e. 

Note that we have dropped an absolute value sign as com- 
pared to Eq. © since \\ip - VII l > ll-A^(v) ~N(ip)\\i holds 
automatically for all quantum channels Af. (See, for example, 
JH.) 

The duality is a straightforward consequence of two ba- 
sic results in quantum information theory. The first is that 
the ability to transmit classical data in two conjugate bases is 
equivalent to the ability to transmit entanglement. That obser- 
vation is the basis for the stabilizer approach to quantum error 
correcting codes l33ll . Here we will use a clean approximate 



formulation due to Renes [25]. The second result is the conti- 
nuity of the Stinespring dilation of a quantum channel, estab- 
lished by Kretschmann et al. @]. Here we only need a corol- 
lary, which can be interpreted as a bound on the information- 
disturbance trade-off. The theorem is stated in terms of the 
following norms: 



Definition 2 For a linear superoperator V : S(A) 
let 



S(B), 



ir 



i(fe) 



max llfidfe ® r)J L , 
ll*lli<i" 111 



where X is an operator on C fe <E) A. Define \\T\\ — 

sup fc ||r||o' c ' ) , the completely bounded trace norm [T(\] (also 
known as diamond norm 4771/ ). 

Note that the convexity of the trace norm implies that 
the supremum is achieved on a rank-one operator (if T is 
Hermitian-preserving, in fact on a pure quantum state). Since 
any operator on A can be "purified" by a system of dimension 
\A\, it follows that the supremum is achieved when k = \A\. 

Theorem 3 (Information-disturbance 1 7]) Let V : A — > 

B ® E be an isometric extension of the channel Af '■ S(A) — > 
S(B) and let Af c : S(A) — > S(E) be the complementary 
channel. Fix a state p G S(A) and let 1Z : S(A) — > S(E) be 
the channel taking all inputs to Af c (p). Then 

\m\\VoAT- id\\l < \\Af c - H\\« < 2 mf \\T> ° N - id^/ 2 . 
Both infimums are over all quantum channels. 

The proof of the fidelity alternative is a fairly routine matter 
of combining these results: 

Proof of Theorem [1] We begin by assuming approximate 
geometry preservation. Fix \ip) _L \ip) in A then set T = 
span(|^), Suppose that 

WAf^-Afimi^Wu-th-S 
for all |0 G A. Then if |x±) = ^(\<p) ± we have 



\M<p)-MM\\i>2-S 
W(X+)-M(X-)\\i>2-8. 



and 



We can therefore transmit data in two conjugate bases through 
Af, which implies that entanglement is also faithfully trans- 
mitted. In particular B25l Thm. 1] (with "guessing probability" 
1-5/2) implies that there exists a channel V : S(B) -t S(T) 
such that 

||(id2®23oj\/)*-*|li < 2v^, 
where |$) = -^(|O)|</5) + |l)|-0)). But trace norm monotonic- 
ity with respect to dephasing the first system then gives 

||(id 2 O 2? o AO* — *IU 

> |][|0X0|® [(2?oA0v-V»] 

+ |1X1|® [(poM)ip- <>p] II 



-IKPoAO^-^lli 



-w(voAf)^-ni- 
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Therefore, \\(T> o J\f)ip — <^||i < 4v5 and by changing 
the choice of dephasing basis, we can conclude that \\T> o 
■A/" — id^lll < aVS. Combining this with Lemma [T9l in 
Appendix |A] implies that \\T> o J\f - id 2 ||<> < 8y/S. The 
information-disturbance theorem (Theorem|3j applied with 7Z 
the map taking all states to N c {<p) then implies that for all 

|w) e T, 

11^(^-^(^)11! < 2(8VS)^ 2 - 4%/2<5 1 / 4 . 

Since T is an arbitrary two-dimensional subspace of A, how- 
ever, the inequality must hold for all \<p) and \uj) in A. 

For the converse, suppose that, for all states \ip),\i(j) € A, 
the inequality ||7V c (y) - 7V c (^)||i < e holds. Fix \tp) 
and l^) then let J\f c be the restriction of Af c to states on 
T = span(|y>), |V>))- Let 72. be the channel on S(T) that al- 
ways outputs Af c (ip). Then once more by Lemma [T9l in Ap- 
pendix lAl \\J\f c — TZ\\ < 2e. Using this time the lower bound 
from Theorem [3] there exists a channel T> : S(B) — > S(T) 
such that i||7?o7V-id||| < 2e. In particular, for all |w) G T, 

Applying the triangle inequality several more times gives: 



tflli 



> 



4V2e> WiVoAf^-ip^ + lKVoN)^- 
> l^-Vlli- IK© oAOte-VOHa 

\f-iP\\i-\Mv>-^)\\i, 



where the final inequality used that the quantum channel V 
cannot increase the trace norm. Rearranging the final expres- 
sion gives the desired inequality. □ 



HI. QUANTUM IDENTIFICATION 

Quantum identification allows a sender to transmit arbi- 
trary quantum states but only allows the receiver to perform 
a restricted set of measurements, namely tests to determine 
whether the transmitted state consists of an arbitrary target 
state. The receiver gets to choose the target state after the 
sender has transmitted, so the code must work for all targets. 
If the test can be performed perfectly, then quantum identifica- 
tion is easily seen to be equivalent to quantum state transmis- 
sion, but in the approximate setting, the tasks are not equiva- 
lent. 

Definition 4 1 18] An e-quantum-ID code for the channel M : 
S(A) — > S(B) consists of an encoding map £ : S{S) ~ > 
S(A) and, for every pure state \<p) G S, a POVM {D Vl 1 — 
D v ) acting on S(B) such that 



V|^},|^)eS tx{{Mo£)^)D v -\{<p\^ 



< e. 



observed outcome \<p)((p\ with probability |((/j|?/>)| 2 . The defi- 
nition therefore ensures that the receiver can simulate the mea- 
surement for all input and target states. 

Many variants of the definition have been proposed. In 
particular, one could imagine drawing a distinction between 
oblivious ID codes, in which the sender is only given a phys- 
ical quantum state to send, and visible ID codes, in which the 
sender knows the identity of the state she is trying to trans- 
mit lfl8ll . Entanglement assistance is also interesting and ex- 
ceptionally powerful in the visible setting 113411 . A different 
task that is nonetheless similar in spirit is to use quantum 
states as "fingerprints" for identifying classical messages in 
a model where pairs of messages are to be compared by a ref- 
eree J35|. For comparing quantum states, however, the simple 
definition considered here is arguably the most natural. 

If we integrate the encoding £ and noisy channel Af from 
Definition[4]into a single map with output B and environment 
E, we may think of the code Hilbert space S as a subspace of 
B®E. More formally, if we let V be the Stinespring dilation 
of JV o £, then V '■ S B <E) E and we can identify the code 
with a subspace of B ® E. This identification simplifies the 
notation and we will use it for the remainder of the paper. 

The main result of this section is a demonstration that a sub- 
space of B <E) E is a quantum-ID code for B iff it is approx- 
imately forgetful for E. (There is a small technical caveat to 
the statement: the reduced states on E must also obey a reg- 
ularity condition for the reverse implication to hold, but we 
will defer discussion of the details.) For the moment, let us 
begin by considering the relationship between quantum iden- 
tification and geometry preservation. 

Lemma 5 Let S C B ® E be a subspace of a tensor product 
Hilbert space that is an e-quantum-ID code for B. In other 
words, suppose that, for each pure state \ip) G S, there exists 
an operator < D v < 11 on B such that for all pure states 

|trV> B -D v - tr^| < e. 

Then, for all \ip), G S, 

F(<p,il>) < F(tp B ,Tjj B ) < F(<p, ip) + 4-v/e. 

Proof Consider the measurement (D v , 1 — D v ) and associ- 
ated channel M:p4 diag( tr pD v . 1 — tr pD v ) which acts 
on S(B). By applying the monotonicity of the fidelity under 
quantum channels to trg; and M, we get 

F(ip, (fi) < F{4, B ^ B ) < F(M(^ B ) ) M( l p B )) 



< 



< F{lp,(f) +2^+6 + 6, 



which proves the lemma. 



□ 



If the receiver had been able to perform the measurement 
CI vXvU 1 — \ f)(f\) on tne input state \tji), then he would have 



The fidelity is therefore approximately preserved by quan- 
tum identification codes. Geometry preservation is defined in 
terms of the trace distance, however, not the fidelity. While it 
is indeed the case that quantum identification codes preserve 
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geometry, the argument is somewhat more delicate because 
applying the measurement (D v , 1 — D„) causes a significant 
drop in the trace distance even as it leaves the fidelity nearly 
unchanged. Instead, Theorem [7] will allow us to infer that 
quantum identification codes preserve geometry by virtue of 
the fact that their complementary channels are forgetful. 

In order to succeed at quantum identification, the following 
lemma demonstrates that it is sufficient to be able to identify 
orthogonal states: 

Lemma 6 Let SC. B ® E be a subspace of a tensor product 
Hilbert space such that for \ip) £ S there exists < D v < 1 
acting on B satisfying 

tr ip B D v >l-6 and tr i) B D^ < 8 

whenever € S is orthogonal to \ip). Then S is a quantum 
identification code with error probability S + 2y/8. 

Proof Let \ip), \ip) € S be arbitrary and let \ip') be orthogonal 

to in span(|<^), \ip)). Write 

\1>) = a\<p)+l3\<f/). 
Expanding shows that tr ip B D v is equal to 

\a\ 2 trp B D v + \(3\ 2 tip' B D v 

+ af3ti\p){p\{D v <g> I) + a^tr\p')(p\{D v ® 1), 

which results in 

\txip B D v - \a\ 2 \ 

< \a\ 2 {l-trp B D v ) + \(3\ 2 trp /B D v 

+ 2\aP\\(<P\(D<P®*)\<S)\ 

< \a\ 2 (l-ixtp B D ip ) + \P\*ix<f/ B D v 

+ 2\a/3\yJ(ip>\(D v ®l)\ip') 
<5 + 2VS, 

where we have used the Cauchy-Schwarz inequality and the 
assumption that orthogonal states in S can be well discrimi- 
nated. □ 

Now we are ready to state and prove our main result on the 
duality between quantum identification and approximate for- 
getfulness. As with the fidelity alternative, we have chosen to 
prove the theorem by composing general purpose results for 
the purpose of pedagogical clarity, which leads to artificially 
poor scaling of the parameters. Readers concerned with opti- 
mizing the parameters should also consult Theorem|8] 

Theorem 7 (Identification and forgetfulness) Quantum-ID 
codes and forgetfulness are dual in the following quantitative 
sense. If a subspace S C B Cgs E is an e-quantum-ID code for 
B, then E is approximately d-forgetful: 



Conversely, if E is approximately S-forgetful, then geometry 
is approximately preserved on B: 

y\ip)M) e S \\ip- - \\ip B - ^ B \\ x < e := 4V2S 

If in addition, the nonzero eigenvalues ofip B lie in the inter- 
val [fx, A] for all \tp) € S, then S is an rj-quantum-ID code for 

Remark While it would be desirable to eliminate the eigen- 
value condition at the end of the theorem, the condition is 
fairly natural in this context. If the reduced states <p E are very 
close to a single state a E for all \ip) E S, then all the \<p) 
are very close to being purifications of a E , meaning that they 
differ from one another only by a unitary plus a small pertur- 
bation. If a E is the maximally mixed state or close to it, then 
the assumption will be satisfied. □ 

Proof For the first part, recall that if S is a quantum-ID code 
with error probability e, then for each pure state \tp) G S there 
exists an operator < D v < 1 on B such that for all pure 
states \<p), e S, 

\tT^ B D v - trr/^l < £■ 

Just as in the proof of TheoremQ] the hypothesis implies that 
data can be transmitted in two conjugate bases with guessing 
probability 1 — e. Running exactly the same argument as was 
made in that proof gives that for all \<p), \ip) € S, 

^ll^-V^Ui < 4V2(2e) 1 / 4 < 7c 1 / 4 . (8) 

The second part is just a restatement of one direction of 
the fidelity alternative, but it is a useful step on the way to 
the third part, which is more challenging since it requires the 
construction of the decoder, that is, the operators D v . 

Indeed, given \<p) E S, and arbitrary \ip) _L \<p) in S, we 
learn from the second part that 

H^-^lli > 2-4^25. (9) 

By Helstrom's theorem on the optimal discrimination of Lp B 
and ip B [36], there exists a projector P v ,0 on B such that 

tr cp B P v ^ > 1 - 2V2S, tr ip B P v ^ < 2V25. (10) 

The problem with using as the decoding is that this pro- 
jector may indeed depend not only on ip, but also on ip. Since 
the goal is to find a single projector that the receiver can use 
to identify (p that will work regardless of whether the input is 
ip or ip, that is unacceptable. Still, let us confirm first that if 
we manage to find one effect operator D v that can deal with 
all ip at once, then by Lemma|6]weTl be done. Our strategy 
for doing so will be to first extend Eq. (TToT > to all mixed states 
orthogonal to | ip) and supported on S, and then use a minimax 
argument to extract a single operator independent of ip. 

Lemma [T71 in Appendix lAl can be used directly to see that 
for all mixed states a supported on S and orthogonal to <p, 

F(<p B ,v B ) < ^maxF(p B ,i; B ) < 4^25^, 
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where the maximization is over all \tjj) G S orthogonal to 
\ip) and the second inequality is an application of Eq. (0 to 
Eq. (|9]l. Applying Eq. (0 a second time gives 

1 ,, 



all* 



| x > 1 - 2(2<5) 1 / 4 -. 



Applying Helstrom's theorem to ip B and a B yields a projector 
P n with 



tr^ B P CT - tva B P a > 1 - 2(2(5) 



1/4 



A 



Von Neumann's minimax theorem then ensures the existence 
of a saddle point in the following two-player game B37ll (see 
Ky Fan IB8ll for a more general version). One player se- 
lects < P < 1 while the other player selects a state a 
supported on S and orthogonal to (p. The strategy spaces 
are therefore closed and convex. The payoff function is 
1 — tr (p B P + tr a B P, which is linear in each argument. Thus, 
the minimax theorem guarantees that there exists an operator 
< D v < 1 such that for all a supported on S and orthogo- 
nal to ip, 

tr <p B D v > 1-2(2(5) x / 4 -, 
A 4 

tr a B D v < 2(2<5) 1 / 4 -, 
M 

and applying Lemma|6]finishes the proof. □ 

Unfortunately, Theorem [7] is not quite strong enough to 
prove our main result on the quantum identification capacity. 
To control the ratio of the largest to smallest eigenvalues of the 
coding states, we need to act on them by typical projectors that 
cause a slight distortion. To accomodate this complication, we 
will instead use the following slightly more flexible version of 
the converse that behaves better with respect to the distortion. 
In particular, the amount of distortion enters the bound on the 
quality of the quantum-ID code in a term independent of the 
eigenvalue constraint. That separation proves to be crucial be- 
cause the eigenvalues cannot be controlled independently of 
the distortion. 

Theorem 8 Let S C B ® E be a subspace and < X < 1 
an operator acting on B <E> E such that tr{XtpX^) > 1 — e 
for all \<p) € S. For any state \uj) € S, write lj = XuiX'. If 
there exists a state Q such that 



< 5 



with < (5, e < 1/15 and, in addition, the nonzero eigenval- 
ues offl E lie in the interval [fi, A], then S is an rj- quantum-ID 
code for j] := 3(30A(5 / n + Z^fl + A8) 1 ' 2 . 

Proof Let \ip) and \t/)) be orthonormal states in S. We will 
begin by showing that (p B and tp B can be effectively distin- 
guished. To this end, consider the states 



which form two orthogonal pairs. Then 

*fc = \p E + ± \(*b +tr fl |^|), 

X± = \p E + \$ E T \{t*B Mi>\ - tr B |^|), 
and, by assumption, 
1„ ~ 



~||^-^||i<<5 and -\\xZ-r-\\i<5- 



Combining these relations reveals that || tr B l^'XV'l ^ 

trs IV'X^HIi < 4(5, hence by the triangle inequality, 

|| tr_B I^XV'llli — 86. But this gives us, by virtue of 
Lemma|T6l 



F{(p B ,^ B ) < 64(5 2 



(11) 



To proceed as in the proof of Theorem|7J we need to show that 
any | ip) 6 S and mixed state a supported on the orthogonal 
complement of \ip) in S can also be distinguished. In order to 
apply Lemma[l7]in AppendixlAl we will show that the largest 
and smallest nonzero eigenvalues of ip B , or equivalently, <p E , 
are well-behaved modulo a little bit of truncation. Indeed, let 
O = {Of) and p = (pj) be the eigenvalues of fl E and (p E , 
respectively, in nonincreasing order. Then 



\0-p L < IW 



Define the set 



J={j ■ (l-7)Pj<Oj < (l+7)Pi>- 



Then 



implying that 



Fixing 7 = 1/2 implies that for each \ip) E S, there is a 
positive semidefinite operator (p B < (p B satisfying tr (p B > 
l—e—2(5 and whose eigenvalues lie in the interval [/i/2, 3A/2]. 

Now let \ip) € S and consider any state a = J7 q^i whose 
support lies in the orthogonal complement of in S. Since 
the states are in S, the truncation procedure of the pre- 
vious paragraph can be used to construct operators Let 
a = Qi^i- Then by Lemma [T71 



9A 



F{ip B ,a B ) < ^max^ 8 ,/^ 
9A 2 



A 1 

9A 2 

2 



/2 

-^64(5 2 = 



576A 2 <5 2 
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Both maximizations are overstates | -0) £ S such that (<p\ip) = 
0. The second inequality follows from the fact that (p B < Cp B 
(and likewise for ip) along with Lemma [18] while the third 
arises by substituting in the result of Eq. ( TTTb . Introducing 
one last decoration for our states, let Cp B — (p B j tv(p B and 
likewise for a. Applying Eq. © with attention paid to the 
fact that (p B and a B are not normalized gives 



IV. QUANTUM IDENTIFICATION CAPACITY 

While it might not be possible to design low error quantum- 
ID codes for any given channel, the situation becomes more 
promising if many uses of the channel are allowed. In analogy 
with classical and quantum data transmission, we can define 
asymptotic quantum-ID codes as follows. 



1 



> 1 



24X5 



1 



1 



25 



> 1 



30A<S 
A* 



where the final inequality uses that e < 1/15. Applying Hel- 
strom's theorem to (p B and a B implies that there exists a pro- 
jector P a such that 



tT(p B P„ - tra B P a > 1 



30X5 



Next we invoke von Neumann's minimax theorem, just as in 
the proof of Theorem[7] for the payoff function 1 — tr (p B P + 
tr a B P, with the strategy space of the second player the con- 
vex hull of the operators %/j , where \tp) € S ranges over states 
orthogonal to \ip). (The operators a B are not normalized but 
that will not cause any difficulties.) This provides an operator 



< D v < 1 such that 



ti<p B D v > 1 - 



30X6 



and 



tr a B D v < 



30X5 



/' 



(12) 
(13) 



But 



tvLp B D v -tv(p B D v \ 



B h 



< W" - <P~ Hi 

<ll^-^lli 

< 2Ve + 25 + 1 1 - (1 - e - 25) | 

< 3^ + 45, 



where the fourth line follows from the gentle measurement 
lemma (Appendix lAl Lemma EOt. the definition of (p B , and 
the fact that Cp B = (p B /(tr ip B ). Similarly, for any a B = 
Si Qi'i'F a convex combination of states arising from \ipi) € 



S perpendicular to \ip), 



tr a B D u 



tr a B D lf 



B 



<X>lh/>. 

i 

<E»(ii^ 

i 

< 2^fe + 25. 



4fh 



&\\x 



1^ 



^flli 



Combining these estimates with the outcome of the minimax 
theorem in Eq. ( fT2] i and Lemma|6]completes the proof. □ 



Definition 9 (Quantum-ID capacity fPUl ) A rate Q is said 
to be achievable for quantum identification over AT if for all 
e > and sufficiently large n, there are e-quantum-ID codes 
for Af® n with encoding domain S of dimension at least 2 n Q. 
The quantum identification capacity Qid (A/ - ) is defined as the 
supremum of the achievable rates. 

The capacity should be interpreted as the number of qubits 
that can be identified per use of the channel Af in the limit 
of many uses of the channel. The only nontrivial channel for 
which the quantum identification capacity was known prior to 
this paper was the identity channel: asymptotically, a noise- 
less qubit channel can be used to identify two qubits. That 
is, QiD^cb) = 2 lfl8ll . As we will see below, the theory of 
the quantum identification capacity is considerably simpler 
when the given channel Af can be used in conjunction with 
noiseless channels to the receiver. This obviously increases 
the capacity, so the interesting question is how much the use 
of Af increases the quantum identification capacity over what 
would have been achievable with the noiseless channels alone. 
When defining the achievable amortized rates it is therefore 
necessary to subtract off two qubits for every noiseless qubit 
channel used per copy of Af. 

Definition 10 (Amortized quantum-ID capacity) A rate Q 
is said to be achievable for amortized quantum identification 
over Af if for all e > and sufficiently large n, there are e- 
quantum-ID codes for idc C3> A/"®" with encoding domain S 
such that Q < —(log \S\ — 2 log |C|). where log = log 2 is the 
binary logarithm throughout this paper. The amortized quan- 
tum identification capacity Q\^(Af) is defined as the supre- 
mum of the achievable rates. 

Readers familiar with the identification capacities of classi- 
cal channels might be surprised to see that the dimension of a 
quantum identification code scales only exponentially with the 
number of channel uses, as opposed to doubly exponentially. 
The essential difference between the classical and quantum 
settings is that the number of distinguishable quantum states 
in dimension d already scales exponentially with d, which 
makes quantum identification a much more demanding task. 
Nonetheless, as we will see below, the amortized quantum 
identification capacity can be positive for some channels with 
zero quantum capacity, like the noiseless bit channel. One 
then finds that the dimension of the quantum identification 
code can scale super-exponentially with the number of qubits 
used to supplement the classical channel. 

The fidelity alternative is a very effective tool for study- 
ing the quantum-ID capacities. As a warm-up, the fact that 
the complements of quantum-ID codes are forgetful supplies 
a quick answer to an open question from 111 811 : 
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Theorem 11 If Af is an antidegradable channel, that is, if 
there exists channel T such thatAf = T~oAf c , then Qi£> (Af) — 
0. This is true in particular for the noiseless cbit channel 
id2. More generally, if the quantum capacity of the channel 
vanishes, Q(Af) = 0, then so does the quantum-ID capacity, 
Qm(.A0=0. 

Proof Given a quantum-ID code for the channel Af that 
encodes as little as one qubit, the channel Af o £ will be 
geometry-preserving if £ is the encoding map. Hence, by the 
fidelity alternative, the channel complementary to Af o £ will 
be approximately forgetful. But if Af is antidegradable, then 
so is Af o £ , meaning that the channel complementary loAfo£ 
can simulate Af o £. But then the complementary channel 
would be simultaneously forgetful and geometry-preserving, 
a contradiction. 

For the more general statement, we show the contraposi- 
tive: assume Qxd {Af) > 0, then for all e > and sufficiently 
large n, Af® n has in particular a 2-dimensional quantum-ID 
code S which is e-close to being forgetful for the environ- 
ment, by Lemma [5] But by Lemma [19] this means that the 
channel from the code qubit to the environment is arbitrarily 
close to a constant map in the diamond norm. At this point we 
can then invoke Theorem[3]of [7] on information-disturbance 
to conclude that the channel from the code qubit to B n can 
be arbitrarily well error-corrected. (Note that this argument 
is following our proof of the fidelity alternative; in particu- 
lar, any 2-dimensional subspace of a quantum-ID code is a 
quantum error-correcting code!) By the Lloyd-Shor-Devetak 
theorem on the quantum capacity (see |0]), this implies that 
there exists an input state for which the coherent information 
I(A H )B n ) > is positive, and hence Q(Af) > 0. □ 

As usual, quantitative statements about asymptotically 
achievable rates and upper bounds on the identification ca- 
pacities are naturally expressed in terms of entropies. For a 
bipartite density matrix <p AB , we write 

H(A) V = H(cp A ) = -txcp A log ip A 

for the von Neumann entropy of ip A . The mutual information 
of the state tp AB is defined to be 

I{A : B) v = H(A) V + H(B) V - H(AB) V 

while the coherent information and the conditional entropy 
are, respectively, 

I{A)B) V = H(B) V - H(AB) V 
H(A\B) V = H(AB) V - H(B) V . 

Our main theorem on the quantum identification capacities 
includes a concise formula for Qf^ that eliminates the op- 
timization over multiple channel uses. 

Theorem 12 (Quantum identification capacity) For any 

quantum channel Af, its quantum-ID capacity is given by 
Qm(Af) = sup„ ±Q$(Af® n ), where 

Qiu (AO = sup{/(A : B) p s.t. 1(A) B) p > 0}, 

\<p) 



where \cp) is the purification of any input state to Af and 
p AB = (id ® Af)(p, and where we declare the sup to be 
if the set above is empty. 

Furthermore, the amortized quantum-ID capacity equals 

QmCAO = su P I(A : B)„ = C E (Af), 
\v) 

the entanglement-assisted classical capacity of Af / li^/ . 

Remark It follows from Theorem Q~2] that the amortized 
quantum-ID capacity of a noiseless cbit channel is one. Rec- 
onciling this observation with Theorem QT| which asserts 
this channel's unamortized quantum-ID capacity is zero, re- 
veals that some amortized noiseless quantum communication 
is necessary to achieve Qf^ without determining how much. 
In fact, inspection of the proof of Theorem Q~2] reveals that, 
for the noiseless cbit channel id2, a zero rate of noiseless 
side qubits is sufficient to achieve the maximum value of one. 
These observations extend to cq-channels, so named because 
they consist of a destructive measurement resulting in classi- 
cal information, followed by the preparation of a state con- 
ditioned on the measurement outcome. For these channels, 
the entanglement-assisted capacity Ce is equal to the unas- 
sisted classical capacity C, also known as the Holevo capac- 
ity JMII3l- As a result, Qm(Af) = for all such channels 
even as Qf™(Af) = C(Af), the latter strictly positive for all 
nontrivial channels. The difference in all cases can be traced 
to a sublinear amount of free quantum communication in the 
amortized setting. 

This effect can be viewed as an instance of (un-)locking 
since the quantum-ID rate increases from strictly to an ar- 
bitrarily large amount by the addition of any positive rate of 
quantum communication, cf. J23L l4ll l42ll . Unlike the previ- 
ously known examples where a certain finite rate is always 
required, however, here an arbitrarily small rate of extra quan- 
tum communication is sufficient to bring about an unbounded 
increase in the capacity. □ 

The intuition behind the achievability of the rates in The- 
orem[T2]is quite simple. The structure of an amortized code 
is illustrated in Figure Q] Fix a state | tp) purifying any in- 
put to the channel Af and let \p) ABE be (1 <8> Utf)\ip), where 
Um is the Stinespring extension of Af. The encoding will em- 
bed the input into a random subspace of a typical subspace 
of A n tensored with ancillary spaces C and F, where C will 
consist of the amortized quantum communication and F the 
environment for the encoding. Since the encoding is into a 
random subspace, it will produce states highly entangled be- 
tween B n C and E n F. By arranging for E n F to be slightly 
smaller than B n C in the appropriate sense, one ensures that 
the states are indistinguishable on the environment E n F. By 
the fidelity alternative, they can therefore be identified by Bob. 
Letting R = — log |C| and / = — log \F\, the condition en- 
suring that E n F be "smaller" than B n C is roughly 

H(E) p + f<H(B) p + R, 

so / — R is chosen to be very slightly less than H(B) p — 
H(E) p . Moreover, measure concentration for the choice of 
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random subspace will make it possible to choose the coding 
subspace almost as large as the ambient space A n CF, which 
in qubit terms has effective size 

nH(A) p + nR + nf. 

The rate of the amortized code will therefore be 

H(A) P + R + f-2R = H(A) P + f-R 

^H{A) p + H(B) p -H(E) p . 

Since p is pure, H(E) p = H(AB) p which means that the rate 
is precisely the mutual information. 




Bob 



Environment 



FIG. 1. Structure of a quantum identification code. Uj^®n and Ve 
are the Stinespring extensions of the noisy channel J\[® n and the 
encoding operation £ . The receiver, Bob, has access to the channel 
output B" as well as C, which consists of nR qubits transmitted 
noiselessly from the receiver. (In the non-amortized setting, there is 
no C.) The encoding map £ is generally noisy, so part of its output 
is transmitted to the environment as F. 

The detailed proof of the achievability of the rates in The- 
orem [T2]builds on the techniques developed in Refs. lfl3ll and 
ll43ll analyzing the properties of generic quantum states. The 
proof will combine the following theorem, originally moti- 
vated by the foundations of statistical mechanics, with the du- 
ality between quantum identification and approximate forget- 
fulness formulated in Theorem[7]or, more precisely, its tech- 
nical variant Theorem [8] 

Theorem 13 (Random versus average states 1 43]) Let S be 

a subspace of B ® E, CI be the maximally mixed state on S, 
and X any operator acting on B ® E with ||AT||oo < 1- If 
\ifi) (z S is chosen according to the unitarily invariant mea- 
sure, then for all e > 

Pr {||trB XfiX f -\x B XipX^ > 77} < 77' 

where 



77 = e + \J ds/ds and 
rf = 2exp(-C*e 2 |S'|). 

Here C > is a constant, dE — | supptr^ XX^| is an up- 
per bound on the dimension of the support oftrg XilX' and 
ds = 1/ tr[(trE XJIXT) 2 ] can be thought of as the effective 
dimension of B. 



Proof This is a slight modification of II43L Thm. 2]. In 
the original, the theorem bounds |j tr^ il — trs f\\i under 
similar hypotheses but 77 includes a correction dependent on 
tr XilX^ . The correction disappears if the argument is ap- 
plied to || tiB XVlX^ — trs XtpX^li instead under the as- 
sumption that || X Hoc < 1, which ensures that the map p 1— > 
XpX^ is 1-Lipschitz. □ 

In order to use Theorem Qj] to make statements about ran- 
dom subspaces, we will use the following lemma 

Lemma 14 Let f be an real-valued function on CP d ( identi- 
fied with rank one projectors acting on <C d ) and suppose that 
f is a-Lipschitz with respect to the trace norm. Let p be the 
unitarily invariant measure on CP d and p the unitarily in- 
variant measure on the space of k-dimensional subspaces of 
C d . If 



>»»}<$(<*) 



then 



p<S: max f (£_) > (I + a)n } < 



2 k- 



9(d)- 



Proof This is a standard discretization argument. Fix a k- 
dimensional subspace So C C d . According to Ref. [8], there 
is a trace norm 77-net M for the rank one projectors on Sq of 
cardinality no more than (5/rj) 2k . If U is distributed accord- 
ing to the Haar measure v, then USq is distributed according 
to the unitarily invariant measure. So, we find by the triangle 
inequality that 



v < U: max 

\Oes , (£|f>= 



< v < U: max 

ie>eAf,(£|€>=i 



V 



■Ik 



n{\0;f(0>vh 



where the second inequality is just the union bound over ele- 
ments of the net. □ 

The following theorem collects the facts we will need about 
type and typical projectors. We omit their definitions, which 
will not be required here and can be found in Ref. |44ll . 

Theorem 15 (Typicality) Let \p) e A ® B ® E and set 

IV-') = \p)® n - For any S, e > sufficiently small there exist 
projectors W B , nf and Tlf on B® n and E® n , respectively, 
and a projection Ilf onto a fixed type subspace of A n such 
that the states 



iB 



and 



B ®i E \^P) 
n s (g)nfnf|v>) 



in? 



,B 
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satisfy the following conditions for X — A n , B n , E n and suf- 
ficiently large n: 

1. ipf = n^/ranknf. 

2. \\ip t - ipt\\i < e. 

3. tr[(^f) 2 ] < 3(1 - 3 £ )-i2-»[^(^)p- c5 ]. 

5. The largest eigenvalue ofipf is bounded above by (1 — 
3 e )-i2-«[tf( E ) P - c5 ]. 

6. The ratio of the largest to the smallest nonzero eigen- 
value of 'ip E is at most 2 2nS . 

where H A and H E should respectively be understood to be 
H A and HfUf in property 4, and c > is a constant. 

Proof If H§ is removed and property 6 omitted, then the 
theorem is precisely a result proved in Ref. I4411 . with nf the 
typical projector for p on E n . U E will be a projector that 
removes all eigenvalues of the reduced density operator on 
E n below the stated threshold. Let 



10 = 



nf <g> n B <g> nf \^) 



The largest eigenvalue of ^ E is bounded above by (1 — 
3e)~ 1 2~™[ H (- E )' , ~ c ' 5 l according to property 5 as stated above 
and the state's rank is at most 2"l ff ( B )p+' 5 ] by property 4. Ap- 
plying Lemma |2T| to the eigenvalues of £ E reveals that the 
sum of all eigenvalues less than or equal to 2~ 2nS / rank£ B 
is at most 



-2nS 



1 - 3e 



< 2 



-7l5 



for sufficiently large n. We can therefore let H E be the orthog- 
onal projection onto the direct sum of the eigenspaces of £ B 
corresponding to eigenvalues larger than 2~ 2nS / rank£ £ . 
Let A be the largest eigenvalue of £ E . The ratio of the largest 
to the smallest eigenvalue after the application of H E will be 
at most 



A 



-2nS 



< 



A 



/rank^f* " 2~ 2nS ■ A 



A redefinition of e completes the proof. 



= 2 



2nS 



a 



Proof (Direct coding part of Theorem [12) The regular and 
amortized cases can be handled simultaneously. Fix an in- 
put state ip as in Theorem [T2l let \p) ABE be a purification of 
(id® Af) (p and let = \p)® n . To construct the code, we will 
need to project ip A " to a type subspace having favorable prop- 
erties, ipf B is the Choi-Jamiolkowski state for the channel 
Af® n restricted to the type subspace A t defined by the pro- 
jector U A . Call this channel Aft, write U t for its Stinespring 
dilation, and consider Aft <8> id c (g id F . C will play the role 
of the noiseless channel from Alice to Bob in the case of the 



amortized capacity and F will represent quantum information 
discarded by Alice at the encoding stage. Our code will con- 
sist of a subspace of S' of At <g C (g F selected according to 
the unitarily invariant measure, which then defines a subspace 
S of (B n (g)(7)(g) (E n <g) F). Our aim will be to show that 
S is likely to be approximately forgetful for E n (g) F when C 
and F are chosen appropriately, allowing for an application of 
Theorem[8] 

Let fl = if>f nEn <g> 7r c <g) 7r F be the image under U t <8 t c <g> 
1 F of the maximally mixed state on A t (g C <g> F. (Recall that 
tt z denotes the maximally mixed state on Z.) Define \ip t ) as 
in Theorem[T5land let Cl = tp B " E " (g ir c <g> ir F . Then 

4> B " En = (n B ® n 2 nf " B "(n s ® nf nf ) 

so for X = n B (g nfnf , Theorem[T3l states that a randomly 
chosen state \oj) in ?7t(A t ) <E> C <E) F will satisfy 



Pr 



ft 



E n F 



> - 

i ~ 2 



< 



for £> = XojX^ and where, for any v > 0, 



i = j/ + ^rankpf nf (g 1 F ] • tr[(^f " <g ^ c ) 2 ], 
?/ = 2 cxp ( - C^ 2 |A t <g C <g F|) . 

We will fix {/ to be v = 2~ 3nS . So by LemmafT4l a random S 
in Ut (A t ) (g C(g i* 1 chosen according to the unitarily invariant 
measure will satisfy 



Pr 

s 



max 

\u,)es 



Q i E-F_~E-F 



> 77 



< 2 
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2|S| 



exp(-Cz/ 2 |A 4 (g)C(g)F|) 



since the function u> i-» \\il EnF — uj E " F \\iis 1 -Lipschitz with 
respect to the trace norm. For convenience, let \F\ — 2 n * and 
| CI = 2 nR . Since \A t \ > 2 n W A ^»- 5 \ choosing \S\ to be 

2 n(H(A) p + R+f-8S] wi jj j ea( j tQ 



max 



< ?7 



(14) 



with high probability for sufficiently large n provided rj de- 
cays at most exponentially with n. 

Now let us determine how to choose / and R in order to 
ensure a small value for rj. Observe that by properties 3 and 4 
in Theorem[T5l 

ranknf nf nf <g t F < 2»^( F )p+ d "+/l and 
tr^f g ^l c ) 2 ] < 3(1 - 3c)- 1 • 2 -^ H{B ^- c5 - R y 

Therefore, 

T] < v + 3 • 2 ,l [- f/ ( B )"- H ( B )p+-f- i? +( 1 + c )' 5 ]/ 2 

provided e is chosen smaller than 1/15. There are two cases 
to consider: 
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Case 1. First suppose that 1(A) B) p > or, equivalently, 
that H(E) p < H(B) p . Under these circumstances, amorti- 
zation is not required. Choosing R = and / = H(B) p — 
H(E) p - (7 + c)5 leads to 77 < v + 3 • 2- 3nS < 4 • 2- 3nS . 
The rate of the associated code will be 

Q = -\og\S\ 

n 

= H(A) p + R + f -85 

= H(A) P + H{B) P - H{E) P - (7 + c)S - 86 

= I(A : B) p - (15 + c)S. 

Case 2. Now suppose that I(A)B) p < so that H(E) p > 
H(B) P . In this case we set R = H(E) P - H(B) P + (7 + c)6 
and / = to again achieve 77 < 4 • 2~ 3ll<5 . This time, the rate 
of the code will be 

Q = -\og\S\-2R 
n 

= H(A) p +R + f-86-2R 

= H(A) p + H(B) p - H(E) p - (15 + c)8 

= I(A : B) p — (15 + c)S. 

We have established that the subspace S corresponds to a 
code of the correct rate. Applying Theorem [8] to £1 and the 
states in S with X = Il B <g> l c ® Ilf Ilf ® l F will complete 
the proof. Recalling that the ratio of the largest to the smallest 
nonzero eigenvalues of Q E F is at most 2 2n<5 , the theorem 
asserts that S is a quantum-ID code with error probability at 
most 

3 (30-2 2m5 -(4-2- 3m5 ) + 4Vi) 1/2 , 
which can be made arbitrarily small for sufficiently large n. 

□ 

Proof (Converse for Theorem[l2) We will address the reg- 
ular and amortized capacities at the same time. Consider an 
amortized quantum-ID code for n copies of Af as illustrated 
in FigureQ] The Stinespring dilations of Af® n and £ together 
have three output registers: one for the channel input, one for 
the transmission to Bob and one going to the environment. 
Abbreviating B = B n C and E = E n F in Figure Q] the 
quantum-ID code is equivalent to a subspace S C B ® E, 
and we can apply our lemmas. 

A key observation is that for any pure state ensemble 
{Px, fx} on S decomposing the maximally mixed state, 

H{B) > H(B\X) = H(E\X) = H(E) - o(n). (15) 

The first inequality is just the concavity of the entropy func- 
tion while the first equality follows from the fact that ip x is 
pure on BE. The final relation is a consequence of Theo- 
rem|7] the fidelity alternative implies that if states can be iden- 
tified on B then they must be indistinguishable on E. Conti- 
nuity of the von Neumann entropy in the form of the Fannes 
inequality IH5I1 shows the correction to be o(n). Thus, sending 
one half of a maximally entangled state $" 4S between S and 



an auxiliary space named A into the circuit of Figure Q] we 
obtain a multipartite pure state ^j abe with respect to which 

log \A\ = H(A) < H{A) + H(B) - H(E) + o(n) 

= I (A : B) + o{n) 

= I(A : B n ) + I (A : C\B n ) + o(n) 

<I(A:B n ) + 2log\C\+o(n). 

Therefore, the amortized quantum identification capacity 
is bounded above by lim^^oo ^g{M® n ) where g (Af) = 
max| v ) I (A : B) p for p = (id ® Af)f- It is well-known, 
however, that g(N® n ) = ng(Af) so the limit is not neces- 
sary in. 

On the other hand, in the non-amortized case, |C| = 1, and 
the rate of the code is bounded above by hl(A ■ B n ) + o(l). 
On the other hand, Eq. (fTBT l above yields 

I(A)B n ) = I{A)B) = H(B) - H(E) > -o(n), (16) 

which is almost what we need, except that the claim of The- 
orem [12] requires strictly positive coherent information. We 
will achieve this by modifying the input state in such a way 
that the coherent information becomes strictly positive and all 
other entropic quantities change only by a sublinear amount 
(in n). 

To this end, note that if Qm(N~) — there is nothing 
to prove, so we shall assume Qm(Af) > 0, in which case 
Q(AT) > by Theorem QT| Hence, fix a k and the purifica- 
tion \4>) of an appropriate input state to J\f® k , such that with 

respect to a A ' B " = (id ® N m )4>, I(A')B k ) a > 1, and let 
t = [max{0, — I(A)B n )^}\ +1. Hence, considering block 
length N = n + kt and the input state $ <g) (j)® 1 to Af® N , re- 
sulting in the state uj aa ' 1bN = (id ® M® N )(^ <g> (j)® 1 ), with 
respect to which we have 

log \A\ < I (A : B n ) + o(n) < I{AA' e : B N ) + o{N), 
I(AA')B N ) > 1 > 0. 

As N = n + o(n), this shows indeed that sup„ ^Q|d {N® n ) 
is an upper bound on all achievable rates. □ 



V. CONCLUSION AND OPEN QUESTIONS 

The fidelity alternative states that geometry preservation 
and approximate forgetfulness are complementary properties, 
much like quantum data transmission and complete forgetful- 
ness. Subject to some technical conditions, geometry preser- 
vation is itself equivalent to quantum identification, an opera- 
tional task very much in the spirit of quantum data transmis- 
sion but strictly weaker. Just as analyzing complete forget- 
fulness has proved a versatile and effective tool for studying 
asymptotic quantum error correction, approximate forgetful- 
ness provides a new approach to asymptotic quantum iden- 
tification. Indeed, by focusing on approximate forgetfulness 
of the complementary channel, we have established that the 
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amortized quantum identification capacity is exactly equal to 
the entanglement-assisted capacity. 

The fidelity alternative suggests a number of possible exten- 
sions, such as asking what happens if geometry is preserved 
not only for pure states but for higher rank mixed states. 
Would such a property have an operational interpretation and 
corresponding interpretation in terms of a form of forgetful- 
ness intermediate between the weak form studied here and 
complete forgetfulness? It would also be interesting to under- 
stand geometry preservation as a type of pseudo-isometry Ir46ll 
from projective space to the Grassmannian of subspaces cor- 
responding to the supports of the mixed output states. 

Meanwhile, Theorem [T2l poses an entertaining and poten- 
tially deep puzzle: why do amortized quantum identification 
and entanglement-assisted classical communication result in 
the same capacity in the absence of any known operational re- 
lationship between these tasks? The theorem also leaves open 
the important problem of evaluating the quantum identifica- 
tion capacity formula in the unamortized case. Similarly, the 
theorem fails to determine how much extra quantum commu- 
nication is necessary to achieve the amortized capacity. In 
particular, does there exist a channel where the required rate 
is strictly positive? We do expect this to be true, but have been 
unable to establish it rigorously. 
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Proof This is a straightforward calculation: 

||tr B 1^1 Hi = n^*! I tr ( trs IvXV'D-X'l 

= max |tr (tre IvXV'l) U\ 

U unitary 

= max |tr \(f)(tp\(i <8> U)\ 

U unitary 



max 

U unitary 



y/F((l®U)<p(l®W),ll>) 



invoking, successively, the duality between trace and sup 
norm, the fact that the maximum is always attained at a uni- 
tary, the defining property of the partial trace, and in the last 
line Uhlmann's relation 0H. □ 

The following lemma provides conditions under which 
mixing preserves near-orthogonality. 

Lemma 17 Let p and o~i, for all i, be states on the same 
Hilbert space such that there exist projectors P and Qi of 
rank < r, and pP < p < XP, pQi < <Ji < XQi such that 
pr < 1. If furthermore for all i, F(p,ai) < e, then 



F(p,w)<S:=e 



for every a — . PiO~i in the convex hull of the <7,. 
Proof We use the definition of the fidelity to first obtain 



e > f tr y/y/paiy/p) > p 2 {txPQiPf . 
Invoking the definition again, we now get from this 



! - Atr JY,t> iPQiP 



< \r .1 'V Pi — pti PQiP 



< XrJ— < Ve~, 



pr p 

using the concavity of the square root twice in turn 14911 . □ 



Appendix A: Miscellaneous Facts 

The following results were used in various proofs but have 
been collected here so as not to distract from the main line of 
argument in the paper. This first relation provides a convenient 
way to calculate mixed state fidelity: 



Lemma 16 For pure states tp,ip on a bipartite system B ® E, 
F{ V > B ^ B ) = ||tr B |^l||i- (AD 



Lemma 18 Let < p < p and < a < a. Then F(p 7 a) < 
F(p,a). 

Proof Denoting unitary congruence of matrices (in particular 
having the same spectrum) by ~, we have 

\/l>^\fp < \fp<y\/l> ~ \fap\fo < V°pVv ~ Vp^Vp- 

Hence, since the square root is operator monotone [49] and the 
trace is invariant under unitary basis change, tr y y/pa^/p < 
tr y/y/pvy/p, completing the proof. □ 

The next lemma constrains the increase of the maximal out- 
put trace norm when tensoring with a fixed-size identity trans- 
formation: 
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Lemma 19 Let T : S(A) — > S{B) be a linear supe rope rata r. 
Then for any t any positive integer, 



Proof Write X, an operator on C* ® A such that ||X|| i < 1, 
in its singular value decomposition as J^ . Sj\vj)(wj\, with 
< Sj < 1 and (vj\vk) — (wj\wk) — Sjk- By convex- 
ity (triangle inequality), ||r||^ is attained with a rank-one 
X = \v)(w\, and for the following fix Schmidt decomposi- 
tions \v) = Y.k a kW)\fk) and \w) = Y,i Pt\9t)\hi) ■ Then, 



||(id t ^r)^!^ = IKid^ni^Hlli 

(id* ® T) I ak/3i\e k )(gi\ ® \fk)(h e \ 

\ ki 

< «*A||(idt ® r) (lefcX^I ® l/fcX^DL 



fef 



= J2 a ^e\\r(\f k )(h e 



lx<t||r||P, 



A-/ 1 



where the first step is just the triangle inequality and the next 
follows from the fact that ||X||i = J2j s j — !• The final 



inequality uses the fact that X)t=i a k an d £;=i ft ^ both 



bounded above by \/£ since 



a 2 



1011 



1. 



□ 



Remark The factor t is optimal, as the example of the matrix 
transposition shows where the bound of the lemma becomes 
an equality. □ 

Lemma 20 (Gentle measurement |50-52]) Let p be a state, 
and < X < 1 be an operator on some Hilbert space, such 
that txpX>l-e. Then, \\p - s/XpVX^ < □ 

The following, final, lemma is used to argue that the small 
eigenvalues of a density operator can be discarded without 
causing much disturbance. 

Lemma 21 Let (pi,p2, ■ ■ ■ ,p r ) be a probability density with 
Pi > Pi+i for all i and let x — {i]Pi < D/r} for some 
0<D<l.Then,^ iex Pi<D. 

Proof Since evidently \x\ < r, 

x , D D 

22Pi ^ \x\j<rj=D, 

iex 



and that's it. 



□ 
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